This assignment is for ETC5521 Assignment 1 by Team brolga comprising of Dilinie Seimon and Diyao Chen.

1 Introduction and motivation

Animal Crossing: New Horizons is a life simulation video game developed and published by Nintendo for the Nintendo Switch. It was released worldwide on the 20th of March 2020. Since it’s inception the game has had an astounding world-wide reception With over 22 million copies of the game being sold in just 5 months.

In the animal crossing world a player takes the role of a customized human character who moves to a deserted island and carries out various activities such as gathering and crafting items, fishing and bug hunting in a village inhabited by various species of animals. Each of these animals called villagers in the animal crossing world, have their own name, gender, birthday, personality, favorite song and their own catchphrase. The items used in performing different tasks in the animal crossing belong to different categories and are also priced at different buying and selling values.

The game also simulates day and night based on a 24 hour clock, and has different animals and insects appearing at specific times of day or night. The level of detail in the design of the Animal Crossing world has attracted many users playing throughout the day.

Although it has been 5 only months since the game was released worldwide, it has been gaining a lot of attention from both players and critics. Even non-players may find the concept of this game well designed and interesting.

The motivation for choosing Animal Crossing for the analysis is to attempt to understand the reason for the immense popularity of the game using user and critic reviews and features within the game (such as the villagers and the items).

Therefore, the analysis of the Animal Crossing: New Horizons game is subdivided into two broad areas and answers the following sub-questions.

An analysis of the villagers and items used in the Animal Crossing world

What are players and critics saying about Animal Crossing?

2 Data description

The dataset used for this analysis was retrieved from TidyTuesday; a project aimed at allowing individuals to practice their data wrangling and visualization skills through the use of real-world data sets.

The retrieved dataset consisted of four sub-datasets; ‘villagers’ and ‘items’ datasets containing data about in-game characters and items, and ‘user reviews’ and ‘critic reviews’ datasets containing data about user and critics reviews on the game.

The ‘villagers’ and ‘items’ datasets have been originally retrieved from VillagerDB, which is a project aimed at making data about Animal Crossing available and easily accessible, while the user and critic reviews have been originally retrieved by scraping the Metacritic website.

2.0.1 Data dictionary

Villagers

The ‘Villagers’ dataset consists of data related to the characters in the Animal Crossing game world. The following are the variables in the villagers dataset and their descriptions.

Variable Description
row_n Numeric identifier of villager
id Short text identifier of villager
name Name of villager
gender Gender of villager
species Species of villager
birthday Birthday of villager
personality Personality of Villager
song Song associated with villager
phrase Catchphrase of the villager
full_id Full text identifier of villager
url Link to image of villager

Items

The ‘Items’ dataset consists of data related to the items in the Animal Crossing game world. The following are the variables in the items dataset and their descriptions.

Variable Description
num_id Numeric identifier of item
id Character identifier of villager
name Name of item
category Category of item
orderable Orderable from catalog
sell_value Selling value
sell_currency Selling currency
buy_value Buying value
buy_currency Buying currency
sources Way/place to acquire item
customizable Is item customizable
recipe Recipe of the item - material made of
recipe_id Recipe ID
games_id Game ID
id_full Full Character ID
image_url Link to image of item

User Reviews

The ‘User Reviews’ dataset consists of the scores and reviews made by users from 2020-03-20 to 2020-05-03.

Variable Description
grade Raw score(0-10) given, 0-lowest and 10-highest
user_name User name of the reviewer
text Raw text of the review
date Date the review was published

Critic Reviews

The ‘Critic Reviews’ dataset consists of the scores and reviews made by critics about the game from 2020-03-16 to 2020-05-01.

Variable Description
grade Raw score(0-100) given, 0-lowest and 100-highest
publication The source of the reviewer
text Raw text of the review
date Date the review was published

3 Data Exploration and Wrangling

3.1 Handling Missing Values

It’s interesting how the buying value of 22.21% of the items are missing. In order to analyse this further, the percentages of missing buying values of each item category was calculated.

Figure 3.1: Percentage of missing buying values in each category of items

Figure 3.1 states that all buying values of fish, fossils and seashells are missing.

Further research claimed that fish, fossils and seashells can not be bought, which explains the missing buying values. A blog on Animal Crossing states that fish can only be acquired by fishing and can not be bought (“Animal Crossing: New Horizons Fish Guide: How, When and Where to Catch All the Fish” n.d.). Further, a fan page on Animal Crossing states that fossils can only be dug up and seashells can be collected (“Animal Crossing Wiki” n.d.). The rest of the missing buying values too were attributed to be due to the inability to purchase the items in the Animal Crossing World.

The columns with over 80% missing values were dropped from the analysis due to the inability to impute values accurately. This did not impact the analysis due to the independence of each of the dropped variables from the rest.

3.2 Currency conversion

The buying and selling values of items in the animal crossing world were expressed in two currencies; Bells and Miles. For simplification of the analysis all buying and selling prices were converted into Bells.

The Nintendo guides states that a Bell Voucher can be bought for 500 Nook Miles, which can thereafter be exchanged for 3000 Bells in the Animal Crossing world . Therefore it was assumed that each Mile equated to 6 Miles in the currency conversion.

4 Analysis and findings

4.1 An analysis of the villagers in the Animal Crossing World

Anmials crossing as a hot shell games, the ways of paly the game is unique.Now, we will do some research of the unique villagers system.

NPC,which is non-player character.It is an essential part of every game, which can guide the player how to play the game better.but,in anmials crossing,NPC become villagers,They will not only teach you to play games, but also accompany you in your live in the game world. There are 381 different types of villagers in the game.

4.1.1 analysis the species of villagers

(#tab:analysis species)Table. Ranking of villagers species
species case prop
cat 23 6
rabbit 20 5
squirrel 18 5
frog 18 5
duck 17 4
cub 16 4
dog 16 4
pig 15 4
mouse 15 4
horse 15 4

Base on the analysis(/(???) species)(/(???) of species), we can see that the cat,rabbit and squirrel are the top3 rank in the anmials crossing,which is cat 6%,rabbit 5%,squirrel 5%.As we can know that in daily life, these three animals are very popular. The game designer,they captures the psyche of the people,and create a lot of characters that they love,which make people love this game.Each character is very eye-catching.That’s the magic of the game,that’s why it’s so popular.

4.1.2 analysis the personality of villagers

Accounding to the analysis(/(???) of personality), we can see that there are eight different personality in the game, lazy is the biggest factor.The theme of this game is leisure, we can know that why lazy personality is top1, which allows the player to feel free and relaxed.The designer set it so carefully,it allows the player to really experience the game.This is one of the reasons for the success of the game.

4.1.3 What are the most common star signs of the villagers based on their birthdates?

Table 4.1: villagers birthday month analysis
birth_month case
10 37
8 36
7 35
12 34
6 33
3 33
1 32
9 32
5 31
11 30

Base on the analysis(/(???))Most villager have birthdays in October,the second is August. So, the most common star signs is libra.

The villagers in Animal Crossing are of different species and also have their own gender, birthdate, personality, favorite song and unique catch phrase.

The number of villagers belonging to each species

Figure 4.1: The number of villagers belonging to each species

As visualized in Figure 4.1, there are 391 villagers belonging to 35 different species in the Animal Crossing world. Cats, rabbits, frogs and squirrels are among the most common species, while bulls, rhinos, cows and octopuses are relatively uncommon. Another interesting observation can be seen in the breakdown of each species by gender. Most species have villagers belonging to both male and female categories, but all bulls and lions are male while all cows are female. This maybe due to the terms ‘cow’ and ‘bull’ being gender specific; cows being the female counterpart of bulls. This however does not explain the non-existence of female lions in the Animal Crossing world.

4.2 An analysis of the Items in the Animal Crossing World

The Animal Crossing world has many different items assisting the villagers in tasks such as building houses, fishing, bug-hunting, digging etc. Each of these items have their own characteristics and belong to one of the 21 categories.

Figure 4.2 visualizes the number of items belonging to each of the item categories. Most of the items in the Animal Crossing are furniture items, while there are also alot of photos. The categories fruit and seashells contain the least number of items, while it’s interesting to see more types of umbrellas and socks than fruits. The Animal Crossing world seems very well equipped with item categories ranging from flooring to umbrellas.

Count of items belonging to each category

Figure 4.2: Count of items belonging to each category

Buying prices of Fish and Fossils were not available

Name Category Selling Price Buying Price Profit

4.3 What do the users say?

The analysis of user feedback on Animal Crossing uses 2999 reviews published by users on Metacritic from 2020-03-20 - 2020-05-03

Figure 4.3 is a plot of the trend of user reviews on Metacritic over time.

Figure 4.3: Trend of user reviews

The astounding reception of Animal Crossing: New Horizons since its world release on the 20th of March 2020 is justified by the number of daily user reviews it has received. Figure 4.3 shows a huge spike in the number of reviews on the 24th of March 2020, lasting till about the 26th of March 2020, which may be attributed to the world release of the game on the 20th of March 2020. The number of reviews there after remain consistent other than another smaller spike around the 28th of April 2020.

Figure 4.4 shows the most common words in the user reviews for the game. The words ‘game’, ‘island’, ‘switch’ and ‘play’ are the most common words used in the reviews and a direct positive or negative significance can not be obtained based on them.

The most used words in the user reviews

Figure 4.4: The most used words in the user reviews

The user reviews also includes a score from 0-10, where 0 is the lowest and 10 is the highest. Figure 4.5 is a plot of the distribution of scores ranging from 0-10.

Figure 4.5: Distribution of user review scores on Animal Crossing: New Horizons

Most users score the game as a 0, while other users score the game as a 10. Almost all user scores are distributed to the two ends of the range of scores with very little reviews scoring the game a 5, 6 or 7. With the sudden hype about the game in the recent past, the low review scores seem questionable and may even thought of as the default score attached to a review if not explicitly stated. Therefore, it might be interesting to calculate the sentiments of the user reviews and relate them to their respective scores, to identify any correlation among them.

A sentiment score between -5 and +5 are given to each user review, where -5 indicates a highly negative sentiment and +5 indicates a highly positive sentiment.

Figure 4.6 is a boxplot summarizing the sentiment scores of all 2999 from 2020-03-20 - 2020-05-03.

Figure 4.6: Summary of sentiments of user-reviews

The boxplot in figure 4.6 states that the overall sentiment of the user reviews to be just slightly positive at 0.4, which is surprising as it would’ve been expected to be much higher with the recent popularity it has gotten. Most of the sentiments of the reviews also lie within a range of -1 to +1, which may indicate to us that Animal crossing isn’t enjoyed by all and there are as many users dissatisfied by the game or disliking the game as those enjoying it.

The mean sentiment score of each review against its review score is plotted in figure 4.7.

Figure 4.7: Mean sentiment score of reviews against its review score

The distribution of points over the plot signifies no clear relationship among the sentiments of the review text and score.

4.4 What do the critics say?

In the analysis the viewpoint of critics on the Animal Crossing: New Horizons game, reviews published by 107 critics such as Forbes, Telegraph and Nintendo Life from 2020-03-16 - 2020-05-01 are used.

Figure 4.8: Trend of critic reviews

Figure 4.8 represents the trend in the number of critic reviews over time. Most critics have reviewed the game on the 16th of March, just before the world release of the game, while a smaller number of critics have made reviews in the days following that.

Since critics have the ability to influence people through their comments, it might be interesting to see the most used positive and negative words in their reviews.

The most used positive words by the critics

Figure 4.9: The most used positive words by the critics

The most used negative words by the critics

Figure 4.10: The most used negative words by the critics

Figure 4.9 shows the most used positive words in the critic reviews while figure 4.10 shows the most negative words. By direct observation of the number of terms in the two word clouds, the positivity seems to overpower the negativity in the critics reviews.

Figure 4.11 is a further breakdown of the words used by critics in their reviews, based on different emotions portrayed.

Figure 4.11: Break down of words used by critics into different emotional categories

As per figure 4.11, most words used in critics reviews are positive, and resonate the emotions of trust, anticipation and joy. A very few words used in reviews resonate the emotions of disgust, fear and anger, concluding an overall positive response from critics on the Animal Crossing game.

The scores given by the critics range from 0-100, 0 being the lowest and 100 being the highest. Figure 4.12 shows the distribution of these scores over critics reviews.

Figure 4.12: The distribution of critics scores on Animal Crossing- New Horizon

It’s interesting how a significant percentage of the reviews score the game above 90, while all of the scores are above 70. Comparing figure 4.5 and figure 4.12, all the critics seem impressed with the game while the users have mixed reviews.

References

“Animal Crossing: New Horizons Fish Guide: How, When and Where to Catch All the Fish.” n.d. GamesRadar+. Accessed August 26, 2020. https://www.gamesradar.com/au/animal-crossing-new-horizons-fish/.

“Animal Crossing Wiki.” n.d. FANDOM. Accessed August 26, 2020. https://animalcrossing.fandom.com/wiki/Fossils_(New_Horizons).

Fellows, Ian. 2018. Wordcloud: Word Clouds. https://CRAN.R-project.org/package=wordcloud.

Sievert, Carson. 2020. Interactive Web-Based Data Visualization with R, Plotly, and Shiny. Chapman; Hall/CRC. https://plotly-r.com.

Silge, Julia, and David Robinson. 2016. “Tidytext: Text Mining and Analysis Using Tidy Data Principles in R.” JOSS 1 (3). https://doi.org/10.21105/joss.00037.

Tierney, Nicholas. 2017. “Visdat: Visualising Whole Data Frames.” JOSS 2 (16): 355. https://doi.org/10.21105/joss.00355.

Wickham, Hadley. 2016. Ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New York. https://ggplot2.tidyverse.org.

Wickham, Hadley, Mara Averick, Jennifer Bryan, Winston Chang, Lucy D’Agostino McGowan, Romain François, Garrett Grolemund, et al. 2019. “Welcome to the tidyverse.” Journal of Open Source Software 4 (43): 1686. https://doi.org/10.21105/joss.01686.

Wickham, Hadley, Jim Hester, and Romain Francois. 2018. Readr: Read Rectangular Text Data. https://CRAN.R-project.org/package=readr.

Zhu, Hao. 2019. KableExtra: Construct Complex Table with ’Kable’ and Pipe Syntax. https://CRAN.R-project.org/package=kableExtra.